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1 Introduction 

While no direct evidence of physics beyond the standard model (SM) has yet been found at the 
CERN LHC, the B° meson provides a rich source of possibilities to probe its consistency In this 
Letter, a measurement of the weak phase (p s of the B 11 meson and the decay width difference 
Ar s between the light and heavy Bi? mass eigenstates is presented, using the data collected 
by the CMS experiment in pp collisions at the LHC with a centre-of-mass energy of 8TeV, 
corresponding to an integrated luminosity of 19.7 fb 1 . 

The CP-violating weak phase <p s originates from the interference between direct B® meson de¬ 
cays into a CP eigenstate ccss and decays through Bj?-Bg mixing to the same final state. Ne¬ 
glecting penguin diagram contributions HU El, (p s is related to the elements of the Cabibbo- 
Kobayashi-Maskawa quark mixing matrix by cf) s ~ —2 (5 S , where /3 S = argf — i/ ls V% / i/ C s Pcb) • 
The prediction for 2/3 s , determined via a global fit to experimental data within the SM, is 
2/3 s = 0.0363 0015 rac * El- Since the value predicted by the SM is very precise, any signifi¬ 

cant deviation of the measured value from this prediction would be particularly interesting, 
as it would indicate a possible contribution of new, unknown particles to the loop diagrams 
describing B® mixing. The theoretical prediction for the decay width difference AT S between 
the light and heavy B® mass eigenstates Bl and Bh, assuming no new physics in B^-Bg mixing, 
is AT s = T l - T H = 0.087 ± 0.021 ps' 1 H. 

The weak phase (p s was first measured by the Tevatron experiments ISj-fS}, and then at the LHC 
by the LHCb and ATLAS experiments 19U131, using Bg —* J/i/u/>(1020), Bg —> J/rp fo(980), and 
Bg —>■ J/xp n 1 7i decays to £ + , where i denotes a muon in the present analysis and h 
stands for a kaon or a pion. Pinal states that do not have a single CP eigenvalue require an 
angular analysis to disentangle the CP-odd and CP-even components. The time-dependent 
angular analysis can be performed by measuring the decay angles of the final-state particles 
l + i~\\ + h~ and the proper decay time of the B® multiplied by the speed of light Iil4| , referred to 
as ct in what follows. In this Letter, the B 11 —» J /ip cp(1020) decay to the final state y 1 }i K 1 K 
is analysed, and possible additional contributions to the result from the nonresonant decay 
Bg —> J /if) K K ~ are taken into account by including a term for an additional amplitude (S- 
wave) in the fit. 

In this measurement the transversity basis is used Ifl4l . The three angles 0 = (9j, ip T , cp T ) of 
the transversity basis are illustrated in Fig. [Tj The angles Qj and (pj are the polar and azimuthal 
angles, respectively, of the }i + in the rest frame of the J/ip where the x axis is defined by the 
direction of the p(1020) meson in the J/jp rest frame, and the x-y plane is defined by the decay 
plane of the tp (1020) —> K 1 K . The helicity angle ipj is the angle of the K + in the cf> { 1020) rest 
frame with respect to the negative J/tp momentum direction. 

The differential decay rate of Bg —> J/tp cp(1020) is represented using the function /(0, ct, oc) as 
in Ref. Ifl5l : 

d 4 T('B 0 ') ^ 

dQ ^ c 8 ^ = /(©/ ct > a ) a £ Of i ct '(!) 


where O,- are time-dependent functions, gj are angular functions, and a is a set of physics pa¬ 
rameters. 

The functions 0;(cf, a) are: 


fljcosh 



+ bi sinh 



+ Cj cos (A m s t) + d{ sin (A m s t) , 


Oj(ct,K) = N^ ct/cT 
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1 Introduction 



Figure V. Definition of the three angles 9j, ipj, and cpj describing the decay topology of B° —> 
J/i p cp( 1020). See text for details. 

where A m s is the mass difference between the heavy and light B® mass eigenstates, cr is defined 
as the product of the lifetime and the speed of light, the function g,-(0) and the terms N„ a,, hi, 
Cj, and d, are given in Table [l] 


Table 1: Angular and time-dependent terms of the signal model. 
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The terms C, S, and D are defined as: 

1-|A| 2 c _ 21A| sin0 S 2|A|cos<£ s 

1 + |A| 2 ' 1 + |A| 2 1 + |A| 2 

using the same sign convention as the LHCb experiment IflOl . Equation ([Tj) represents the 
model for Bj?. The model for B[? is obtained by changing the sign of the c, and d, terms. The 
parameters |A^| 2 , |Ao| 2 , and /ly | 2 are the magnitudes squared of the perpendicular, longi¬ 
tudinal, and parallel P-wave amplitudes, respectively; |As| 2 is the magnitude squared of the 
S-wave amplitude representing the fraction of nonresonant decay Bg —> J /if) K K “; the param¬ 
eters 5 l. So, S |, and S$ are their corresponding strong phases. 

The complex parameters A f are defined as A j = (q/p)(Af/ Af), where the amplitudes Aj- 
(Af) describe the decay of a B° (B°) meson to a final state /, and the parameters p and q relate 
the mass and flavour eigenstates as Bh = pBg — qB® and Bl = pBg + qB® Ifl6l . Assuming 
polarisation-independent CP-violation effects, A f can be simplified as A j = iff A, where iff is 
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the CP eigenvalue. The amount of CP violation in mixing is assumed to be negligible 
Thus, no \cj/p\ terms are used in Eq. ([Tj) when going from the Bl 1 model to the Bj? model. Since 
direct CP violation is expected to be small theoretically El and is measured to be small |9|, |A| 
is set to 1.0. 

2 The CMS detector 

The central feature of the CMS apparatus is a 13 m long superconducting solenoid of 6 m in¬ 
ternal diameter, providing a magnetic field of 3.8 T. Within the solenoid volume are a silicon 
pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and 
scintillator hadron calorimeter, each composed of a barrel and two endcap sections. Muons 
are measured in gas-ionisation detectors embedded in the steel flux-return yoke outside the 
solenoid. Extensive forward calorimetry complements the coverage provided by the barrel 
and endcap detectors. 

The main subdetectors used for the present analysis are the silicon tracker and the muon de¬ 
tection system. The silicon tracker measures charged particles within the pseudorapidity range 
\t ]| < 2.5. It consists of 66 million 100x150 pm 2 silicon pixels and more than 9 million silicon 
strips. For nonisolated particles of transverse momentum 1 < pj < lOGeV and \rj\ < 1.4, the 
track resolutions are typically 1.5% in pj and 25-90 (45-150) pm in the transverse (longitudinal) 
impact parameter fl8l . 

Muons are measured in the pseudorapidity range \rj\ < 2.4, with detection planes made using 
three technologies: drift tubes, cathode strip chambers, and resistive plate chambers. The rel¬ 
ative pj resolution for low transverse momentum muons with pj < 10 GeV is between 0.8% 
and 3.0% depending on |?/| |l9l . 

The first level (LI) of the CMS trigger system, composed of custom hardware processors, uses 
information from the calorimeters and muon detectors to select the most interesting events in a 
fixed time interval of less than 4 ps. The high-level trigger (HLT) processor farm further reduces 
the event rate from around 100 kHz to around 400 Hz, before data storage. At the HLT stage 
there is full access to all the event information, including tracking, and therefore selections 
similar to those applied offline can be used. 

A more detailed description of the CMS detector, together with a definition of the coordinate 
system used and the relevant kinematic variables, can be found in Ref. ||20| . 

3 Event selection and simulated samples 

A trigger optimised for the detection of B hadrons decaying to J/ip is used to collect the data 
sample. The LI trigger used in this analysis requires two muons, each with pj greater than 
3GeV and \t]\ < 2.1. The HLT requires a J/ip candidate displaced from the luminous region. 
Each muon pj is required to be at least 4 GeV and the pj of the reconstructed muon pair must 
be greater than 6.9 GeV. The J/xp candidates are reconstructed from the muon pairs selected by 
the trigger in the invariant mass window 2.9-3.3 GeV. The three-dimensional (3D) distance of 
closest approach of the two muons to each other is required to be smaller than 0.5 cm. The two 
muon trajectories are fitted to a common decay vertex. The transverse decay length significance 
L xlj ! a Lxy is required to be greater than 3, where L xy is the distance between the centre of the lu¬ 
minous region and the secondary vertex in the transverse plane, and Cl is the L xy uncertainty. 
The secondary-vertex fit probability, calculated using the y 2 and the number of degrees of free¬ 
dom of the vertex fit, must be greater than 10%. The angle p between the dimuon transverse 
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3 Event selection and simulated samples 


momentum and the L XXJ direction is required to satisfy cos p > 0.9. 

Offline selection criteria are applied to the sample. The individual muon candidates are re¬ 
quired to lie within a kinematic acceptance region of pj > 4GeV and \t]\ < 2.1. Two oppo¬ 
sitely charged muon candidates are paired and required to originate from a common vertex. 
Dimuon candidates with invariant mass within 150 MeV of the world-average J/0 mass Il2ll 
are selected. Candidate 0(1020) mesons are reconstructed from pairs of oppositely charged 
tracks with pj > 0.7 GeV, after removing the muon candidate tracks forming the J/0. Each 
selected track is assumed to be a kaon, and the invariant mass of a track pair is required to be 
within 10 MeV of the world-average 0(1020) mass lf2T| . 

The Bj? candidates are formed by combining J /ip and 0(1020) candidates. A kinematic fit of 
the two muon and two kaon candidates is performed, with a common vertex, and the dimuon 
invariant mass is constrained to the nominal J/0 mass l2TI . A B) 1 candidate is retained if the 
J/0 0(1020) pair has an invariant mass between 5.20 and 5.65 GeV and the y 2 vertex fit proba¬ 
bility is greater than 2%. 

Multiple pp collisions can occur in the same beam crossing (pileup). The average number of 
primary vertices in an event is approximately 16, and each selected event is required to have 
at least one reconstructed primary vertex. If there are multiple vertices, the one that minimises 
the angle between the flight direction and the momentum of the B) 1 is selected. The selected 
primary vertex is used to measure ct. The quantity ct is calculated from the transverse decay 

j gO gQ j jgO gQ 

length vector of the Bg, L xy , as ct = "VdcAw ' Pt^Pt' where m pDG is the world-average B|) 
mass 12D and p T is the B) 1 transverse momentum vector. The decay length is calculated in the 
transverse plane to minimise effects due to pileup. 

Simulated events are produced using the pythia v 6.424 Monte Carlo event generator [[22]. The 
B hadron decays are modelled with the EVTGEN simulation package |[23] . For the B: 1 signal gen¬ 
eration, the evtpvvcplh module is used, which simulates the double vector decay taking into 
account neutral meson mixing and CP-violating time-dependent asymmetries. Final-state radi¬ 
ation is included in EVTGEN through the PHOTOS package 112411251 . The events are then passed 
through a detailed GEANT4-based simulation |26l of the CMS detector. The predicted distri¬ 
butions from simulation of many kinematic and geometric variables are compared to those 
from data and found to be in agreement. The simulated samples are used to determine the sig¬ 
nal reconstruction efficiencies, and to study the background components in the B(? signal mass 
window. 

The main background for the Bg —> J/0 0(1020) decays originates from nonprompt J/0 mesons 
from the decay of B hadrons, such as B°, B^, A b , and B c . Since the B c cross section is small |21| 
compared to that of the B ( ) 1121] . the B c decays can be neglected. The contribution of the 
Ab —> J/0X channels to the selected events is also found to be small, and its mass distribu¬ 
tion in the selected mass range is observed to be flat. The effect of background with a similar 
signal signature on the physics observables is studied using simulated events, and found to be 
negligible. The mass distribution in the signal region is shown in Fig.[2j and the distribution of 
ct and its uncertainty a ct in Fig. [3] 

Efficiency corrections owing to the detector acceptance, trigger selection, and selection criteria 
applied in the data analysis are taken into account in the modelling of the angular observables. 
The angular efficiency e(0) is calculated using a fully simulated sample of Bg —» J/0 0(1020) —> 
p + p~K + K~ decays. In this sample, the AT S parameter is set to zero to avoid correlations be¬ 
tween the decay time and the angular variables. The e(0) is fitted to a 3D function of © to 
properly account for the correlation among the angular observables. 
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Figure 2: The J/i/’K K invariant mass distribution of the Bj? candidates. The solid line is a fit 
to the data (solid markers), the dashed line is the signal component and the dot-dashed line is 
the background component. 



B° proper decay length [cm] Proper decay length uncertainty [cm] 


Figure 3: The ct distribution (left) and its uncertainty a ct (right) of the B° candidates. The 
solid line is a fit to the data (solid markers), the dashed line is the signal component and the 
dot-dashed line is the background component. For the ct distribution the pull, defined as the 
difference between the observed events and the fit function applied to the sum of the signal 
and background, divided by the statistical uncertainty in the observed events, is displayed in 
the histogram in the lower panel. 

The trigger includes a decay length significance requirement for the J/i p candidates. Accord¬ 
ingly, the value of ct is required to be greater than 200 }im in order to avoid a lifetime bias 
coming from the turn-on curve of the trigger efficiency. The efficiency histogram of ct is then 
fitted with a straight line plus a sigmoid function. 
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4 Flavour tagging 


4 Flavour tagging 

The flavour of each B|? candidate at production time is determined with an opposite-side (OS) 
flavour tagging algorithm. Since b quarks are produced as bb pairs, the flavour of the signal B® 
meson at production time can be inferred from the flavour of the other B hadron in the event. 
The tagging algorithm used in this analysis requires an additional muon or electron in the 
events containing a reconstructed B° —» ]/xp cp (1020) decay. The additional lepton is assumed 
to originate from a semileptonic decay of the OS B hadron, b —» ivX decay, with £ = e, p. For 
all the events in which an OS tag lepton is found the algorithm provides a tag decision £ based 
on the charge of the lepton: £ = +1 for signal Bj?, and £ = — 1 for signal Bg. 

The tag decision is affected by processes that reverse the charge-flavour correlation, such as 
cascade decays b — > c — > l, or semileptonic decays of neutral OS B mesons that have oscillated 
to their antiparticles before decaying. Leptons produced from flavour-uncorrelated sources, 
such as semileptonic decays of promptly produced charmed hadrons, pion and kaon decays, 
j/rp decays, and Dalitz decays of neutral pions further contribute to diluting the tag informa¬ 
tion. The probability of assigning a wrong flavour to the signal Bg is described by the mistag 
probability co, defined as the ratio of the number of wrongly tagged events divided by the total 
number of tagged events, which is directly related to the dilution factor D = (1 — 2co). The 
value of co is estimated from data on a per-event basis, as described below. 

The tagging algorithm is optimised by maximising the tagging power 'P ta g = £ tag (l — 2m) 2 , 
which represents the equivalent efficiency of a sample with perfect tagging (co = 0). The term 
£tag is the tagging efficiency, defined as the fraction of events to which a tag decision is found 
by the tagging algorithm. 

Opposite-side muons and electrons are reconstructed with the particle-flow algorithm Ii27l l28il. 
In each event, the muons (electrons) that are not part of the reconstructed Bg —> J/ip (p( 1020) de¬ 
cay are required to be identified with loose identification criteria. If there are multiple muons 
(electrons) in the event, the one with the highest pj is chosen at this stage. The tag lepton se¬ 
lections are then optimised separately for muons and electrons using simulated signal samples 
of B° —» j/p (p (1020) decays. A cut-based opposite-side lepton selection is applied to reduce 
the number of leptons not originating from B-hadron decays. To optimise the selection, sev¬ 
eral variables are studied, and a set of five discriminating variables (pi, rj, d xyz , AR, Isolation) 
is identified. A total number of more than four million alternative cut configurations have 
been tested to determine the configuration that maximises the tagging power, independently 
for muons and electrons. The tag muon is thus required to have pj > 2.2 GeV, the 3D impact 
parameter d xyz with respect to the primary vertex associated with the signal B 1 - 1 is required to 

be smaller than 0.1 cm, and the angular separation, A R = V (A (p) 2 + (A rj) 2 , between the muon 
and the signal Bg is required to be greater than 0.3, where A cp and A// are the azimuthal angle 
and pseudorapidity differences between the directions of the tag muon and the Bg candidate. 
Electrons are required to have pj > 2.0 GeV, d xyz < 0.1cm, and AR > 0.2. In addition, a 
multivariate discriminator (MVAg-^) tuned to separate genuine electrons reconstructed by the 
particle-flow algorithm from pions and photons is applied to tag electrons by requiring that the 
discriminator is greater than 0.2 ESI . 

A multilayer perceptron neural network (MLP-NN) of the TMVA toolkit E9l is used to fur¬ 
ther separate the right- and wrong-tag leptons. Training and testing is performed using ap¬ 
proximately 24000 and 20400 simulated Bg —> J/(/> cp( \ 020) events for the muon and electron 
MLP-NNs, respectively, and two independently optimised sets of variables. Half of each sam¬ 
ple is used for training and the other half for testing. The input variables common to both 
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MLP-NNs are pj, )], and d xyz of the tag lepton, and two variables related to activity in a cone 
around the lepton direction: a particle-flow relative isolation variable I128H and a px-weighted 
average of the charges of the particles in the cone. Specific variables are further introduced in 
the MLP-NNs separately for muons and electrons. For muons, the pj relative to the axis of the 
jet containing the muon is used, while for electrons the M VA e _ n is exploited. 

The mistag probabilities are obtained from data using the self-tagging channel B ± —> J / (/ > K , 
where the charge of the reconstructed kaon determines the flavour of the EL and, in the absence 
of mixing, of the opposite-side B hadron as well. The mistag probabilities are parametrised 
separately for muons and electrons with analytic functions of the MLP-NN discriminators in 
order to provide a per-event value of the predicted mistag probability go. The functional forms 
of the parametrisations are obtained from the simulated B(? sample. The candidate B ; mesons 
are required to pass a selection as similar as possible to that applied for the reconstruction of the 
signal B® candidates. The same trigger and J /ip reconstruction requirements as for the Bj? signal 
sample are applied. A charged particle with pj > 2 GeV, assumed to be a kaon, is combined 
with the dimuon pair in a kinematic fit. An unbinned extended maximum-likelihood fit to the 
invariant J/i/^KA mass is performed, yielding a total of (707 ± 2) x 10 3 reconstructed B~ —> 
]/^K ± events. The tagging efficiency evaluated with the B ± —» data sample is (4.56 ± 

0.02)% and (3.92 ± 0.02)% for muons and electrons, respectively, where the uncertainties are 
statistical. 

The mistag parametrisation curves evaluated with the B ~ control channel for muons and elec¬ 
trons are shown in Fig.jiJ where the parametrisations for the B ; and B; 1 simulated samples are 
shown for comparison. 

Most tagged events have only a single electron or muon tag. If both tags are available for a 
specific event (about 3.5% of the cases), the tag lepton with the greatest value of the dilution 
factor is retained, and the tag decision and the estimated mistag are taken from this tag lepton. 
The overall lepton tagging efficiency is (8.31 ± 0.03)%, as measured in data with the B~ —» 
J/ip K ± data sample. 

To correct for potential effects induced by the dependence of the tagging algorithm on the 
Eg —t ]/ip cp (1020) simulation, the mistag probability is calibrated by comparing the per-event 
predicted go to the measured te meas obtained from the B : —> J/ip KL data control channel. This 
is then fit to the function <u meas = po + p \ (co - go 1 ), chosen to limit the correlation between the 
function parameters po and p\. The parameter go' is fixed to a value roughly corresponding 
to the mean of the calculated mistag probability, go' = 0.35. The resulting calibration param¬ 
eters are po = 0.348 ± 0.003 and pi = 1.01 ± 0.03, and their uncertainties are propagated as a 
statistical uncertainty in the OS tagger. 

The systematic uncertainties related to the calibration parameters po and pi are dominated by 
the dependence of these parameters on the flavour of the signal-side B hadron. The uncer¬ 
tainties are estimated from B^ data and simulated samples of B 3 and B* events. Systematic 
uncertainties originating from possible variations in the CMS data-taking conditions, the sig¬ 
nal B hadron kinematics, the analytic form of the mistag parametrisation functions, and the 
model used to fit the B~ invariant mass distribution are tested and found to be negligible. 

The overall tagging power of the OS lepton tagger, measured with a sample of B 31 —> j/ipK^ 
events, is Vtag = (1.307 ± 0.031 (stat) ± 0.007 (syst)) %, corresponding to the combined mistag 
probability go = (30.17 ± 0.24 (stat) ± 0.05 (syst)) %. 
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5 Maximum-likelihood fit 




Figure 4: The mistag probabilities co, defined as the ratio of the number of wrongly tagged 
events divided by the total number of tagged events, as a function of the MLP-NN discrim¬ 
inators for muons (left) and electrons (right). The data points (solid markers) are placed at 
the average weighted value of the events in each bin. The vertical bars show the statistical 
uncertainties and the horizontal bars the bin width. The solid line represents the parametrisa- 
tion curve extracted from the background-subtracted B 21 data; the dashed and dot-dashed lines 
refer to the parametrisations extracted from the simulated B® and B ; samples, respectively. 

5 Maximum-likelihood fit 

An unbinned maximum-likelihood fit to the data is performed by including the information 
on the B° invariant mass (m B o), the three decay angles (0) of the reconstructed B, candidates, 
the flavour tag decision (£), ct, and a c t, obtained by summing in quadrature the decay length 
uncertainty and the uncertainty in the transverse momentum. The fit is applied to the sample 
of 70 500 events, out of which 5 650 are tagged events, selected in the mass range 5.24-5.49 GeV 
and ct = 200-3 000 }im. From this multidimensional fit, the physics parameters of interest 
Ar s , cp s , the Bg mean lifetime ct, | 2 , |y\o| 2 , As | 2 , and the strong phases d , 5_, and 
are determined, where is defined as the difference — S±. The P-wave amplitudes are 
normalised to unity by constraining | An | 2 to 1 — \A_ | 2 — A)| 2 - The fit model is validated with 
simulated pseudo-experiments and with simulated samples with different parameter sets. 

The likelihood function is composed of probability density functions (pdf) describing the signal 
and background components. The likelihood fit algorithm is implemented using the RooFit 
package from the ROOT framework l30l . The signal and background pdfs are formed as the 
product of pdfs that model the invariant mass distribution and the time-dependent decay rates 
of the reconstructed candidates. In addition, the signal pdf also includes the efficiency function. 
The event likelihood function C is represented as: 

L, Ts T Tbkg/ 

L S = N S [f (0,cf,a) <g> G (ct,a ct ) e (©)] P s (m B o) P s (a ct ) P s (£), 

Akg As kg A kg (cOS 6j, (pj ) Rbkg (COS tpT ) Pb kg(^0 Akg(^B^) A>kg (Act) A>kg((?)/ 

where L s and Lbkg are the pdfs that describe the B° —> ]/xp (p(1020) signal and background 
contributions, respectively. The number of signal (background) events is N s (Nbkg)- The pdf 
/(0,cf, a) is the differential decay rate function /(O, ct, a) defined in Eq. |l]l, modified to in- 
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elude the flavour tagging information and the dilution term (1 — 2 <x>), which are applied to 
each of the C; and d, terms of the equation. In the / expression, the value of <30 is set to zero, 
following a general convention. The function e(0) is the angular efficiency and G(ct,a c t ) is a 
Gaussian resolution function, which makes use of the event-by-event decay time uncertainty 
<7 C f, scaled by a factor k. The k factor is a function of ct and is introduced as a correction to take 
care of residual effects when the decay time uncertainty is used to model the ct resolution. The 
function 7c(cf) is measured using simulated samples and, on average, its value equals 1.0 to 
within a few percent. The average decay time uncertainty including the x(ct) factor equals 23.4 
pm. All the parameters of the pdfs are left free to float in the final fit, unless explicitly stated 
otherwise. The value of AT S is constrained to be positive, based on recent measurements 13JJ. 

The signal mass pdf P s (m B o) is the sum of three Gaussian functions with a common mean; 
the two smaller widths, the mean, and the fraction of each Gaussian function are fixed to the 
values obtained in a one-dimensional mass fit. The background mass distribution Pbkg( m B?) 
is described by an exponential function. The background decay time component Pbkg( c 0 is 
described by the sum of two exponential functions. The angular parts of the backgrounds pdfs 
Pbkg ( cos 0 t, <Pt ) arid Pbk g (cos ipj ) are described analytically by a series of Legendre polynomials 
for cos$t and cos ipj and sinusoidal functions for cpj. For the cos 6j and cpj variables a two- 
dimensional pdf is used to take into account the correlation among the variables. 

The signal decay time uncertainty pdf P s (c r C f) is a sum of two Gamma functions, with all the 
parameters fixed to the values obtained by fitting a sample of background-subtracted events. 
The background decay time uncertainty pdf Pbkg(cG) is represented by a Gamma function. 
All the parameters are fixed to the values obtained by fitting the Bj? invariant mass sideband 
regions, defined by the mass ranges m B q = 5.24-5.28 GeV and 5.45-5.49 GeV. The functions 
P s (£) and Pbkg(£) are the tag decision £ pdfs, which have been obtained from the data. 


6 Results and systematic uncertainties 

The results of the fit are given in Table[2j where the quoted uncertainties are statistical only. The 
corresponding correlation matrix for the statistical uncertainties in the physics fit parameters 
is shown in Table [ 3 ] Since the likelihood profiles of 8\\, 5$^, and 7l s | 2 are not parabolic, the 
statistical uncertainties quoted for these parameters are found from the increase in — log C by 
0.5. In the fit, the value of A m s is allowed to vary following a Gaussian distribution with mean 
and standard deviation set to (17.69 ± 0.08) x 10 12 h/s H32) . As a cross-check, the Am s value 
is also left free to float and its best fit value is found to be in statistical agreement with the 
set value. The various data distributions and the fit projections are shown in Figs. |2j|3j and [5] 
The drop in the cos dj distribution at the range limits is identified as being caused by close-by, 
high-angle kaon tracks. The central value and the 68%, 90%, and 95% confidence level (CL) 
likelihood contours of the fit in the A T s -cj) s plane are shown in Fig. [6] 

Several sources of systematic uncertainties in the primary measured quantities are investigated 
by testing the various assumptions made in the fit model and those associated with the fit 
procedure. 

The systematic uncertainty associated with the assumption of a constant efficiency as a function 
of ct is evaluated by fitting the data with an alternative ct efficiency parametrisation, which 
takes into account a small contribution of the decay time significance requirement at small ct 
and first-order polynomial variations at high ct. The differences found in the fit results with 
respect to the nominal fit are used as systematic uncertainties. 
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6 Results and systematic uncertainties 


Table 2: Results of the fit to the data. Uncertainties are statistical only. 


Parameter 

Fit result 

<j Ps [rad] 

AT s [ps" 1 ] 
|Ao| 2 

|A S | 2 

|AJ 2 

<S|| [rad] 

^s_L [rad] 

<S_i_ [rad] 
ct [pm] 

-0.075 ± 0.097 

0.095 ± 0.013 

0.510 ± 0.005 

0 012 + 0 009 
u.urz _ 0 007 

0.243 ± 0.008 

3 43 +0-07 
-0.09 

0 37 + 0 - 28 
u - 0/ -0.12 

2.98 ± 0.36 
447.2 ± 2.9 


Table 3: Correlation matrix for the statistical uncertainties in the physics fit parameters. 



| Ao | 2 

[ Asl 2 

| AJ 2 

S \\ 

<$ s ± 

< 5 + 

CT 

Ar s 

0 s 

| Ao | 2 

+ 1.00 

+ 0.19 

- 0.64 

- 0.08 

- 0.18 

- 0.02 

+ 0.38 

+ 0.70 

+ 0.11 

As 2 

— 

+ 1.00 

- 0.02 

- 0.32 

- 0.79 

- 0.10 

- 0.16 

+ 0.01 

+ 0.03 

| AJ 2 

— 

— 

+ 1.00 

- 0.27 

+ 0.03 

- 0.06 

- 0.50 

- 0.77 

- 0.11 

< S || 

— 

— 

— 

+ 1.00 

+ 0.26 

+ 0.21 

+ 0.11 

+ 0.03 

- 0.02 

^ S_L 

— 

— 

— 

— 

+ 1.00 

+ 0.06 

+ 0.11 

- 0.04 

- 0.06 


— 

— 

— 

— 

— 

+ 1.00 

+ 0.03 

+ 0.01 

+ 0.01 

CT 

— 

— 

— 

— 

— 

— 

+ 1.00 

+ 0.55 

+ 0.10 

Ar s 

— 

— 

— 

— 

— 

— 

— 

+ 1.00 

+ 0.10 

0 s 

— 

— 

— 

— 

— 

— 

— 

— 

+ 1.00 





Figure 5: The angular distributions (cos 9j, cos tpj, q>j) of the B° candidates from data (solid 
markers). The solid line is the result of the fit, the dashed line is the signal component, and the 
dot-dashed line is the background component. 

The uncertainties associated with the variables cos 6j, cpj, and cos ipj of the 3D angular ef¬ 
ficiency function are propagated to the fit results by varying the corresponding parameters 
within their statistical uncertainties, accounting for the correlations among the parameters. 
The maximum variation of the parameters extracted from the fit is taken as the systematic 
uncertainty. The systematic uncertainty owing to a small discrepancy in the kaon pj spectrum 
between data and simulation is evaluated by weighting the events to make the simulated kaon 
pj spectrum match that in data. 

The uncertainty in the ct resolution associated with the k factor is propagated to the results. A 
set of test samples is produced with the 7c(cf) factor varying within their uncertainty, assumed 
to be Gaussian. One standard deviation of the distribution describing the difference between 
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[rad] 

Figure 6: The CMS measured central value and the 68%, 90%, and 95% CL contours in the AF S 
versus (p s plane, together with the SM prediction 00. Uncertainties are statistical only. 

the ct resolution with the nominal fit and with a varying x(ct) is taken as the systematic uncer¬ 
tainty Since the 7c(cf) factor is obtained from simulation, the associated systematic uncertainty 
is assessed by using a sample of prompt J/t p decays obtained with an unbiased trigger and com¬ 
paring them to similarly processed simulated data. In this way the decay time resolution for 
ct « 0 is obtained. The x(ct) factor is varied within the values observed in data and simulation. 
The resulting variations of the physics parameters are taken as systematic uncertainties. 

Although the likelihood function makes use of a per-event mistag parameter, it does not con¬ 
tain a pdf model for the mistag distribution. The associated systematic uncertainty is estimated 
by generating simulated pseudo-experiments with different mistag distributions for signal and 
background and fitting them with the nominal fit. 

The dominant tagging systematic uncertainty originates from the assumption that the signal 
and calibration channels have the same tagging performance. It is evaluated using a calibration 
curve, obtained from simulated samples, that describes the mistag probability of B|? as function 
of the mistag probability of . The fit to the data is repeated, re-calibrating the mistag proba¬ 
bility with the B°-B ± calibration curve, and the differences found in the fit results with respect 
to the nominal fit are used as the systematic uncertainties. 

Possible biases intrinsic to the fit model are also taken into account. The nominal model func¬ 
tion is tested by using simulated pseudo-experiments, and the average of the pulls (defined as 
the difference between the result of fit to the pseudo-experiment sample and the nominal value) 
is used as a systematic uncertainty if it exceeds one standard deviation statistical uncertainty. 

The various hypotheses that have been assumed when building the likelihood function are 
tested by generating simulated pseudo-experiments with different hypotheses and fitting the 
samples with the nominal likelihood function. The obtained pull histograms of the physics 
variables are fitted with Gaussian functions, and the average of the pull is used as a system¬ 
atic uncertainty if the difference with respect to the average exceeds one standard deviation 
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7 Summary 


statistical uncertainty. Concerning the modelling of the J /ip K K invariant mass distribution, 
the background model is changed to a Chebyshev function from the nominal exponential pdf. 
The ct background pdf is changed to the sum of three exponential functions instead of the two 
exponential functions of the nominal fit. The angular background pdf is generated by using the 
background simulation angular shapes instead of the fit ones. The effect of not including the 
angular resolution is also tested, using the residual distributions obtained from simulations. 
The RMS of the angular resolutions were found to be 5.9, 6.3, and 10 mrad, for cos 6j, cos tpj, 
and (pj respectively. The contribution to the systematic uncertainty from the background tag¬ 
ging asymmetry is negligible. 

The hypothesis that |A| = 1 is tested by leaving that parameter free in the fit. The obtained 
value of | A | is consistent with 1.0 within one standard deviation. The differences found in the 
fit results with respect to the nominal fit are used as systematic uncertainties. 

The alignment systematic uncertainty affects the vertex reconstruction and therefore the decay 
times. That effect is estimated to be 1.5 jim from studies of known B hadron lifetimes It33l . The 
systematic effect owing to the very small number of B® originating from B+ —> B[? tc v feed- 
down, which would be reconstructed with large values of ct, has been found to be negligible. 

The measured values for the weak phase (p s and the decay width difference AT S are: 

cp s = —0.075 ± 0.097 (stat) ± 0.031 (syst) rad, 

AT s = 0.095 ± 0.013 (stat) ± 0.007 (syst) ps' 1 . 

The systematic uncertainties are summarised in Table [4] The uncertainties in the (p s and AT S 
results are dominated by the statistical uncertainties. 

Table 4: Summary of the uncertainties in the measurements of the various B)' 1 parameters. If 
no value is reported, then the systematic uncertainty is negligible with respect to the statistical 
and other systematic uncertainties. The total systematic uncertainty is the quadratic sum of the 
listed systematic uncertainties. 


Source of uncertainty 

(ps [rad] AT S [ps J ] 

IA.I 2 

|A S | 2 

ku 2 

<5i| [rad] <5s± [rad] & 

r [rad] 

ct [^m] 

ct efficiency 

0.002 

0.0057 

0.0015 

— 

0.0023 

— 

— 

— 

1.0 

Angular efficiency 

0.016 

0.0021 

0.0060 

0.008 

0.0104 

0.674 

0.14 

0.66 

0.8 

Kaon pj weighting 

0.014 

0.0015 

0.0094 

0.020 

0.0041 

0.085 

0.11 

0.02 

1.1 

ct resolution 

0.006 

0.0021 

0.0009 

— 

0.0008 

0.004 

— 

0.02 

2.9 

Mistag distribution modelling 

0.004 

0.0003 

0.0006 

— 

— 

0.008 

0.01 

— 

0.1 

Flavour tagging 

0.003 

0.0003 

— 

— 

— 

0.006 

0.02 

— 

— 

Model bias 

0.015 

0.0012 

0.0008 

— 

— 

0.025 

0.03 

— 

0.4 

pdf modelling assumptions 

0.006 

0.0021 

0.0016 

0.002 

0.0021 

0.010 

0.03 

0.04 

0.2 

| A | as a free parameter 

0.015 

0.0003 

0.0001 

0.005 

0.0001 

0.002 

0.01 

0.03 

— 

Tracker alignment 

— 

— 

— 

— 

— 

— 

— 

— 

1.5 

Total systematic uncertainty 

0.031 

0.0070 

0.0114 0.022 0.0116 

0.680 

0.18 

0.66 

3.7 

Statistical uncertainty 

0.097 

0.0134 

0.0053 

0.008 

0.0075 

0.081 

0.17 

0.36 

2.9 


7 Summary 

Using pp collision data collected by the CMS experiment at a centre-of-mass energy of 8 TeV 
and corresponding to an integrated luminosity of 19.7 fb \ 49200 B^ —> J/ip <^(1020) signal 
candidates were used to measure the weak phase (p s and the decay width difference AT S . The 
analysis was performed by using opposite-side lepton tagging of the B(? flavour at the produc¬ 
tion time. Both muon and electron tags were used. 
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The measured values for the weak phase and the decay width difference between the Bj? mass 
eigenstates are cp s = —0.075 ± 0.097 (stat) ± 0.031 (syst) rad and AT S = 0.095 ± 0.013 (stat) ± 
0.007 (syst) ps 1 , respectively. The measured values are consistent with those obtained by the 
LHCb Collaboration using B°—>J/ if>K 1 K decays 1(341 . 

Our measured value of (p s agrees with the SM prediction. Our result confirms AT S to be 
nonzero, with a value consistent with theoretical predictions. The uncertainties in our (p s and 
AT S measurements are dominated by statistical uncertainties. Our results provide independent 
reference measurements of (p s and AT S , and contribute to improving the overall precision of 
these quantities and thereby probing the SM further. Since our measurement precision is still 
limited by statistical uncertainty, substantial improvement is expected from LHC i/s = 13 TeV 
high-luminosity running that will be available over the next few years. 
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